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(54) Gated-clock registers for low-power circuitry 



(57) A method and apparatus for analyzing each mi- 
croinstruction in a microinstruction-based electronic cir- 
cuit having a plurality of registers to determine which 
registers in a processing cycle are not involved in the 
processing cycle, and preventing those registers from 



being clocked during such processing cycle. Hence, in- 
active registers during a processing cycle do not con- 
sume power at the level of active registers, thus lowering 
overall power usage by any system employing such gat- 
ed-clock registers. 
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Description 

BACKGROUND OF THE INVENTION 

5 1. Field of the Invention 

This invention relates to electronic circuitry, and more particularly to a method and apparatus for reducing power 
usage by registers in low-power circuits. 

io 2. Description of Related Art 

Low power electronic circuits are commonly used in battery-operated products such as cellular telephones, cal- 
culators, computers, etc. Much effort has been devoted by the electronics industry to provide greater functionality for 
such circuits while reducing power requirements. 

is Various types of low power circuits use a plurality of registers (also known as register files) to store multiple bits 

of data. For example, a microprocessor or microcontroller circuit often has multiple registers for storing data values 
and instructions. FIGURE 1 is a block diagram of a typical prior art microprocessor showing a set or bank of 8 registers 
1 are coupled to input selectors 2 and output selectors 3 as shown. Other elements, such as multiplexers 4, 5, barrel 
shifter 6, and arithmetic logic unit 7, are coupled in a loop to the inputs selectors 2 and output selectors 3 for the 

20 registers 1 . Typically, one to three registers are involved in any one processing cycle. A microinstruction controlling a 
particular processing cycle specifies which registers are involved. 

Registers are generally designed to be fast and static (i.e., do not require refreshing), and thus generally comprise 
a plurality of static storage elements, each element comprising a plurality of transistor elements. Static storage elements 
can consume a fairly large amount of power. It would be desirable to reduce power consumption by registers used in 

25 low power circuits. 

The present invention provides a method and apparatus for accomplishing this goal. 

SUMMARY OF THE INVENTION 

30 The invention is based on the observations that (1) a characteristic of static storage elements is that they draw 

some power when clocked (/.a, when a synchronizing signal is applied to each element to transfer input to output, if 
the element is enabled), but draw substantially more power when the data within the elements is changing instead of 
static, and (2) not al) registers are used in each processing cycle. Accordingly, the present invention analyzes each 
microinstruction to determine which registers in a processing cycle are not involved in the processing cycle, and pre- 

35 vents those registers from being clocked during such processing cycle. Hence, inactive registers during a processing 
cycle do not consume power at the level of active registers, thus lowering overall power usage by any system employing 
such gated-clock registers. 

The details of the preferred embodiment of the present invention are set forth in the accompanying drawings and 
the description below. Once the details of the invention are known, numerous additional innovations and changes will 
40 become obvious to one skilled in the art. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 is a block diagram of a typical prior art microprocessor showing a set of 8 registers coupled to input 
45 selectors and output selectors. 

FIGURE 2 is a block diagram showing the preferred embodiment of the invention. 

Like reference numbers and designations in the various drawings indicate like elements. 

DETAILED DESCRIPTION OF THE INVENTION 

so 

Throughout this description, the preferred embodiment and examples shown should be considered as exemplars, 
rather than as limitations on the present invention. 

Overview 

55 

A characteristic of static storage elements is that they draw some power when clocked (i.e., when a synchronizing 
signal is applied to each element to transfer input to output, if the element is enabled), but draw substantially more „ 
power when the data within the elements is changing instead of static. In conventional circuitry using registers (e.g., 
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microprocessors or microcontrollers), all registers are clocked even when no change is to be made to the contents of 
each register. Further, not all registers are used in each processing cycle. Accordingly, the present invention analyzes 
each microinstruction to determine which registers in a processing cycle are not involved in the processing cycle, and 
prevents those registers from being clocked during such processing cycle. (As used herein, "microinstruction" means 

5 any collection of bits controlling processing cycles). 

FIGURE 2 is a block diagram showing the preferred embodiment of the invention. A set of registers 20, similar to 
the register bank 1 shown in FIGURE 1 , have their clock inputs 21 coupled through corresponding AND gates 26 to a 
clock signal from a clock source 24. Also coupled to each AND gate 26 is decode logic 28, which in turn is coupled to 
an instruction source 30 (which may be, for example, a ROM). While FIGURE 2 shows the functional aspects of the 

10 invention most clearly, in practice it may be preferred to directly couple the clock source 24 to the decode logic 28, and 
incorporate the AND gates 26 directly into the decode logic 28. 

The decode logic 28 determines which of the registers 20 are involved in each processing cycle by examining the 
appropriate fields or bit positions of the microinstruction controlling that particular processing cycle. That is, each mi- 
croinstruction has at least one indication of which of the plurality of registers are active during each processing cycle. 

15 if a register is not involved in a processing cycle, then the decode logic 28 blocks the clock signal from the clock source 
24 from being applied to the clock input 21 of that register 20 through the corresponding AND gate 26. (In the preferred 
embodiment, the output control signal of the decode logic 28 is synchronized to the falling edge of the clock to avoid 
clock "glitches" or noise). Conversely, if a register is involved in a processing cycle, then the decode logic 28 allows 
the clock signal from the clock source 24 to pass to the clock input 21 of that register 20. Hence, unused (inactive) 

20 registers during a processing cycle do not consume power at the level of used (active) registers, thus lowering overall 
power usage by any system employing such gated-clock registers. 



Example 



25 The following commented code sets forth one example of the invention. The code may be directly compiled to 

generate circuitry that embodies the functionality of the decode logic 28, using, for example, "Design Compiler" from 
Synopsys, Inc., Mountain View, California. However, one of skill in the art will recognize that the decode logic 28 may 
also be implemented in conventional combinatorial logic, as a programmable logic array, or in equivalent circuitry. 



30 

Entity which decodes the instruction word to determine which 

register (if any) is to be updated baaed on the current: 

instruction 

library IEEE; 
35 use IEEE. std_logic_ll€4 .all; 



40 



entity reg_ updace_control is 
pore ( 

-- Update specified register if logic 1: 
update_reg_a : out std_logie ; 
updace_reg_b : out std_logic? 
update_reg_c - out std_logic; 
update_reg_d : out std_logic; 
update_reg_e ; out std_logic; 
45 update_reg_f : out std_logic; 

update^ reg_g : out std_logic; 
update_reg_h : out std^logic; 
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) ; 



Current processor instruction: 
instruction : in std_logic_vector (23 downto 0) ; 

Result of instruction condition check, 
condition : in std_logic 



end reg_update_control; 

architecture behave of reg_update_control is 

■- Declaration of register mapping in instruction fields: 



constant REG I STERNA 
constant REGISTER_B 
constant REGISTER_C 
constant REGISTERED 
constant REG I STERNE 
constant REGISTER~F 
constant REGISTER^ G 
constant REGISTER H 



std_logic_vector (2 downto 0) 

std_logic_vector (2 downto 0) 

std_ log ic_ vector (2 downto 0) 

std_logic_ vector (2 downto 0) 

std_ logic_vector (2 downto 0) 

std — logic_yector (2 'downto 0) 

std_logic_vector (2 downto 0) 

std_logic_vector (2 downto 0) 



•« 0 0 o M 
"001" 
"010" 
M 011" 
"100 M 

"110 M ; 
"HI"; 



Declaration of instruction format: 
Format 1 allows register updacing 
Format 2 does not 

constant INSTR_F0RMAT_1 : std_logic := * 0'; 

constant INSTR^FORMAT^ : std~logic := 1 1 • ; 

Explanatory constants to enhance readability? 
constant DONT — UPDATE : std_logic :« '0'; 
constant C0NDITI0N_MET : std_logic := '1'; 



Local signals: 
signal target_reg 
signal update_bit 
signal update_reg 



std_logic_vector (2 downto 0) ; 

std_logic; 

std_logic; 



begin 

The target register is read from the current instruction 
in format 1 commands and defaults to register A otherwise. 

target_reg <= ins cruet ion (2 downto 0) 

when (instruction (23) = ltfSTR_FORMAT_l ) 

else REG I STERNA; 

— The update bit is read from the current instruction 

--in format 1 commands and defaults to "don't update" otherwise 

update^bit <«= instruction (3 ) 

when (instruction (23) =■ INSTR_F0RMAT_1 ) 
else DONT UPDATE; 
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A. register will update if the bit in the instruction is set, 

and the condition for execution was met (this is evaluated 

elsewhere) 
update_reg update^bit 

when (condition -""CONDITION MET) 
else DONT_UPDATE; 

The individual update controls combine the overall update 

control and the target register obtained earlier; 

note that all control signals are combinational and 

must be stabilized elsewhere 
update_reg_a <= update_reg when 
else DONT_UPDATS; 
update_reg_b <«= Update^reg when 
else DONT_UPDATE; 
update_reg_ c <=» update_reg when 
else DONT_UPDATE; 
update_reg_d «;* update_reg when 
else DONT^ UPDATE ; 
update_reg_e <= update^reg when 
else DONT — UPDATE ; 
updafce_reg_£ <:= update_reg when 
else DONT_UPDATE; 
update_reg_g <«» update_reg when 
else DONT_UPDATE; 
update_reg^Ja <= update__reg when 
else DONT_UPDATE; 
end behave; 

The following commented code sets forth one example of a register 20, including the AND gate 26, as well as the 
circuit that synchronizes the decode logic 28 to the falling edge of the clock. 

Entity which implements a 16 bit data register with an update 

control supplied by a module which decodes an instruction 

library IEEE; 

use IEEE . std_logic_1164 . all; 

entity data_regiater is 
port ( 

register value: 
data_value ; out atd_logic_vector ( is downto 0) ; 

new value co load: 
next_value : in std_logic_vector (IS downto 0) ; 

control signal from reg_update_control : 
do_update : in otd_logic ; 



( targe t_reg 




REGISTER, 


_A) 


(target_reg 




REGISTER^ 


_B) 


( targe t^reg 




REGISTER_C> 


<target_reg 




REGISTERED) 


(target_reg 




REG I STERNE) 


(target_reg 




REGISTER^ 


J) 


(target_reg 


ss 


REGISTER^ 


G) 


(target_reg 




REGISTER^ 


H) 



EP 0 855 639 A1 



system clock i 
clock : in std_logic; 

-- active low asynchronous reset: 
reset_b : in std logic 

> ; 

end data_register; 

architecture behave of data^register is 

Explanatory constants to enhance readability: 
constant RESET_VALUE : std_logic_vector (15 downto 0) 

: = (others « ' o 1 ) 
constant CLOCKjFALIjING : std_logic :« ' 0'; 
constant CLOCKJ&ISING : std_logic ' 1'; 

- - Local signals : 

signal stable_update : std^logic; 
signal masked_^clock : scd_logic; 

signal regis ter_output : std_logic_vector ( 15 downto a); 
begin 

The clock to the register which stabilizes the update signal 
operates on the system clock falling; this is because the 
instruction is fetched on the system clock rise, so the update 

• - controls are unstable during system clock high; the information 
needed to determine whether to fire the register is needed 
around the subsequent system clock high, and is 
thus optimally maintained by a falling edge register. 
During reset, register updates are disabled; it requires the 

— assertion of the update control from the reg_update_control 
module to enable the clock to the main register bank- 

process (clock, reset — b) 
begin 

if reset_b « '0' then 

stable_update <= »0 f ; 
elsif (clock* event and clock = CLOCK_FALIiINO) then 

stable^update do_update; 

end if; 
end process; 

The clock to the main register bank is enabled only when the 
stabilized control signal is active high; in this event, the 
system clock is passed to the data registers themselves. Since 
the stabilized control signal is a constant value from falling 

-- edge to falling edge of the system clock, there is no danger of 
introducing glitches. 

masked^ clock <= stable_update and clock; 
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The data is held in a bank of registers as in prior art, except 
that the clock is controlled in the manner described above. 

process (mag ked_e lock , resetja) 

begin 

if reset — b * ■ 0* then 

regis ter_output RESET_VAIjUE; 
elsif <masked_clock' event and masked_clock CLOCK_RISINO) then 

regiscer__output <» next^value; 
end if; 
end process; 

The register value is the data output from the module: 
data_value <= register_output; 
end behave; 

Power savings using the Invention depends on the implementing semiconductor technology, the style of the storage 
element circuitry comprising each register, and the width of each register. In one embodiment in which the invention 
was used for a 16-bit register, power savings from clocking only active registers was about 20% to 25%. For a smaller 
register width, the percentage saved would be lower, while for a wider register, the percentage saved is potentially 
considerably higher. 

A number of embodiments of the present invention have been described. Nevertheless, it will be understood that 
various modifications may be made without departing from the spirit and scope of the invention. Accordingly, it is to be 
understood that the invention is not to be limited by the specific illustrated embodiment, but only by the scope of the 
appended claims. 



Claims 

1 . A circuit for controlling power usage in an electronic system (1 ) having a processing cycle synchronized by a clock 
signal from a clock source and (2) utilizing a plurality of registers clocked by the clock signal and under control of 
microinstructions, each microinstruction having at least one indication of which of the plurality of registers are 
active during each processing cycle, the circuit comprising: 

(a) a plurality of control gates, each having an output coupled to a clock input of a corresponding registers 
and an input coupled to the clock signal from the clock source; 

(b) decode logic, having an input coupled to a source of microinstructions and a plurality of output, each coupled 
to a corresponding control gate, for determining, from the at least one indication of which of the plurality of 
registers are active during each processing cycle, which of the plurality of registers will be inactive during each 
processing cycle, and for blocking the clock signal through the control gate of each such inactive register, 
thereby controlling power usage of the system. 

2. A method for controlling power usage in an electronic system (1) having a processing cycle synchronized by a 
clock signal from a clock source and (2) utilizing a plurality of registers clocked by the clock signal and under control 
of microinstructions, each microinstruction having at least one indication of which of the plurality of registers are 
active during each processing cycle, the method comprising the steps of: 

(a) determining, from the at least one indication of which of the plurality of registers are active during each 
processing cycle, which of the plurality of registers will be inactive during each processing cycle; and 

(b) blocking the clock signal to each such inactive register, thereby controlling power usage of the system. 
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